Enhancing Protein Fold Prediction Accuracy Using an Ensemble of Different Classifiers
نویسندگان
چکیده
Protein fold prediction problem is considered as a key point to protein structure recognition and structural discoveries. Recent advances in pattern recognition field brought a great interest to apply pattern classification techniques to tackle this problem. From the pattern recognition point of view, the protein fold prediction problem can be expressed as a multi-class classification task that can be solved by using feature selection, feature extraction and classification approaches. In this paper, a new classifier ensemble, based on combination of five different classifiers, namely: Naïve Bayes, Multi Layer Perceptron (MLP), Support Vector Machine (SVM), LogitBoost, and AdaBoost.M1 combined with five combinational policies (Average of Probabilities, Product of Probabilities, Minimum of Probabilities, Maximum of Probabilities and Majority Voting), is proposed to tackle this problem. To study the effectiveness of the proposed method and compare our results with previously reported results, the dataset produced by Ding and Dubchak is used. Experimental results show that the proposed method (using majority voting as its combinational policy) enhances the protein fold prediction accuracy better than most of the other classification methods found in the literature, using the same set of features (employed by Dubchak et al.).
منابع مشابه
A novel hybrid method for vocal fold pathology diagnosis based on russian language
In this paper, first, an initial feature vector for vocal fold pathology diagnosis is proposed. Then, for optimizing the initial feature vector, a genetic algorithm is proposed. Some experiments are carried out for evaluating and comparing the classification accuracies which are obtained by the use of the different classifiers (ensemble of decision tree, discriminant analysis and K-nearest neig...
متن کاملApplication of ensemble learning techniques to model the atmospheric concentration of SO2
In view of pollution prediction modeling, the study adopts homogenous (random forest, bagging, and additive regression) and heterogeneous (voting) ensemble classifiers to predict the atmospheric concentration of Sulphur dioxide. For model validation, results were compared against widely known single base classifiers such as support vector machine, multilayer perceptron, linear regression and re...
متن کاملMargin-based ensemble classifier for protein fold recognition
Recognition of protein folding patterns is an important step in protein structure and function predictions. Traditional sequence similarity-based approach fails to yield convincing predictions when proteins have low sequence identities, while the taxonometric approach is a reliable alternative. From a pattern recognition perspective, protein fold recognition involves a large number of classes w...
متن کاملFault Detection of Bearings Using a Rule-based Classifier Ensemble and Genetic Algorithm
This paper proposes a reduct construction method based on discernibility matrix simplification. The method works with genetic algorithm. To identify potential problems and prevent complete failure of bearings, a new method based on rule-based classifier ensemble is presented. Genetic algorithm is used for feature reduction. The generated rules of the reducts are used to build the candidate base...
متن کاملClassifier Ensemble Framework: a Diversity Based Approach
Pattern recognition systems are widely used in a host of different fields. Due to some reasons such as lack of knowledge about a method based on which the best classifier is detected for any arbitrary problem, and thanks to significant improvement in accuracy, researchers turn to ensemble methods in almost every task of pattern recognition. Classification as a major task in pattern recognition,...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Austr. J. Intelligent Information Processing Systems
دوره 10 شماره
صفحات -
تاریخ انتشار 2009